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MAINTAINING ANNOTATIONS FOR DISTRIBUTED AND VERSIONED FILES 

CROSS-REFERENCE TO RELATED APPLICATIONS 

[0001] The present application is related to commonly-owned U.S. Pat. No. 
6,519,603, entitled "Method And System For Organizing An Annotation Structure 
And For Querying Data And Annotations", commonly-owned, co-pending 
applications 10/083,075, entitled "Application Portability And Extensibility Through 
Database Schema And Query Abstraction," 10/600,014, entitled "Universal 
Annotation Management System", 10/310,192, entitled "Annotation Validation Using 
Partial Checksums", 10/600,382, entitled "Heterogeneous Multi-Level Extendable 
Indexing For General Purpose Annotation Systems," and Attorney Docket No. 
ROC920030291US1, entitled "Dealing With Annotation Versioning Through Multiple 
Versioning Policies and Management Thereof, filed herewith, which are herein 
incorporated by reference in their entirety. 

BACKGROUND OF THE INVENTION 
Field of the Invention 

[0002] The present invention generally relates to annotating data objects and, 
more particularly, to maintaining annotations across different versions of data 
objects. 

Description of the Related Art 

[0003] An annotation system is one where descriptive information is stored about 
objects, or parts of objects, as an annotation. Some annotation systems store 
annotations separately, without modifying the objects themselves. An annotation 
store, typically a database, contains information for the annotation, typically in the 
form of descriptive text, or other classifiers. An indexing scheme is typically used to 
map each annotation to the object or positions of annotated portions within the 
object. The annotation system may be configured to display the annotation in 
response to the user selecting the annotated portions of the object. For example, 
one common method for displaying annotations is in a flyover text box generated 
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when a user positions a mouse pointer in proximity to the annotated portion or 
portions of the object. 

[0004] The annotations themselves may serve different functions for different 
applications. For example, for some applications, annotations may serve as an 
interactive help system, providing description of the functionality, or explain the use 
of the annotated object. As an example, in a business application, an annotation 
may explain the use of an economic performance parameter, such as a current 
year's revenue by division. For other applications, annotations may serve to provide 
expert feedback, comments, or clarifications from a reviewer or a creator of a 
document or other data object. As examples, the creator of a document may insert 
an annotation highlighting the relevance of a certain piece or group of data, while a 
reviewer of the same document may insert an annotation with instructions for 
revising the document, possibly to correct an error. 

[0005] A challenge is presented, however, when an annotated document is 
changed, for example, creating a new version of the document. A policy should be 
established that determines how annotations should be maintained for different 
versions of a document. One approach is to apply a "no-change" policy on the 
document, whereby annotations are maintained for the original document only and 
do not apply to subsequent versions. While this approach is simple, there may be 
many cases where at least some annotations would apply to a subsequent version 
of a document, such as when the particular portion described by the annotation has 
not changed. Therefore, another approach would be to "carry-forward" annotations 
to subsequent versions of a document. However, in some cases, some of the 
annotations may no longer apply, such as when an annotated portion of the 
document is edited to render the annotation invalid or deleted entirely. Further, even 
if the annotation does apply to the new version, annotation indexes indicating the 
corresponding annotated portions of the document may need to be updated to 
reflect the changes in the document. 

[0006] Thus, each of these policies has drawbacks and using either alone 
provides a less than ideal solution. Accordingly, there is a need for an annotation 
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system that supports multiple policies for maintaining annotations after changes to 
an annotated source document, for example, allowing an annotation author to select 
one or more policies that best fit a particular situation. 

SUMMARY OF THE INVENTION 

[0007] The present invention generally is directed to a method, system and 
article of manufacture for annotating multiple versions of a document. 

[0008] One embodiment provides a method for managing annotations. The 
method generally includes detecting one or more changes to a document having at 
least one annotation corresponding to at least one portion of the document prior to 
occurrence of the one or more changes, determining if the at least one annotation 
should be applied to the document as changed, and if so, updating an annotation 
record containing the at least one annotation to reflect the one or more changes to 
the document. 

[0009] Another embodiment provides a method generally including receiving a 
document to be checked into a content management system, the document having 
at least one annotation corresponding to an annotated portion thereof, wherein one 
or more changes to the document have been made subsequent to creation of the 
annotation, determining if the annotation should be applied to the document as 
changed, and if so, updating an annotation record containing the at least one 
annotation to reflect the one or more changes to the document. 

[0010] Another embodiment provides a computer-readable medium containing a 
program. When executed by a processor, the program performs operations 
generally including detecting one or more changes to a document having at least 
one annotation corresponding to at least one portion of the document prior to the 
one or more change, determining if an annotation corresponding to at least one 
annotated portion of the document prior to the change should be applied to the 
document as changed, and if so, updating an annotation record containing the at 
least one annotation to reflect the one or more changes to the document. 
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[0011] Another embodiment provides a system generally including at least one 
application, a content management system for managing a plurality of documents 
manipulated by the at least one application, an annotation database for holding 
annotation records, each containing annotation data related to one or more of the 
plurality of documents, and an annotation maintenance component. The annotation 
maintenance component is generally configured to detect changes to a document 
managed by the content management system, the document having at least one 
annotation corresponding to an annotated portion thereof, wherein one or more 
changes to the document have been made subsequent to creation of the annotation, 
determine if the annotation should be applied to the document as changed, and, if 
so, update an annotation record containing the at least one annotation to reflect the 
one or more changes to the document. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0012] So that the manner in which the above recited features, advantages and 
objects of the present invention are attained and can be understood in detail, a more 
particular description of the invention, briefly summarized above, may be had by 
reference to the embodiments thereof which are illustrated in the appended 
drawings. 

[0013] It is to be noted, however, that the appended drawings illustrate only 
typical embodiments of this invention and are therefore not to be considered limiting 
of its scope, for the invention may admit to other equally effective embodiments. 

[0014] FIG. 1 is a computer system illustratively utilized in accordance with 
embodiments of the present invention. 

[0015] FIG. 2 is a relational view of software components according to one 
embodiment of the present invention. 

[0016] FIG. 3 is a table illustrating exemplary policies for maintaining annotations 
for multiple versions of a document. 
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[0017] FIGs. 4 illustrates an exemplary graphical user interface (GUI) screen 
according to one embodiment of the present invention. 

[0018] FIG. 5 is another relational view of software components according to one 
embodiment of the present invention. 

[0019] FIG. 6 is a flow chart illustrating exemplary operations for maintaining 
annotations for multiple versions of a document according to one embodiment of the 
present invention. 

[0020] FIG. 7 is a flow chart illustrating exemplary operations for enforcing 
annotation maintenance policies according to one embodiment of the present 
invention. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

[0021] The present invention provides methods, systems, and articles of 
manufacture for maintaining annotations for changing source documents. For some 
embodiments, an annotation system may support multiple policies, with each policy 
dictating how (or if) an annotation created for a version of a source document should 
be applied to subsequent versions of the source document. A user creating an 
annotation may select from a group of available policies to be applied to the 
annotation. For some embodiments, policies may be enforced when annotated 
documents are checked into a content management system used to manage the 
annotated documents. 

[0022] As used herein, the term document generally refers to any type of content 
containing data object including, but not limited to, text documents, spreadsheets, 
schematics, images, media files, and the like. The term version, when applied to a 
document, merely refers to the content of the document at a corresponding time and 
does not imply that multiple versions of a document necessarily exist at any time. In 
other words, a subsequent version of a document may exist without of previous 
version still existing. Further, there is no implication that different versions of a 
document will have a common file name with version appended (e.g., v2, v3, etc.). 
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Accordingly, a document modified and saved with an entirely different name may still 
constitute a different version of the document. 

[0023] As used herein, the term annotation generally refers to any type of 
descriptive information associated with one or more data objects. Annotations may 
exist in various forms, including textual annotations (descriptions, revisions, 
clarifications, comments, instructions, etc.), graphical annotations (pictures, 
symbols, etc.), sound clips, etc. While an annotation may exist in any or all of these 
forms, to facilitate understanding, embodiments of the present invention may be 
described below with reference to textual annotations as a particular, but not limiting, 
example of an annotation. Accordingly, it should be understood that the following 
techniques described with reference to textual annotations may also be applied to 
other types of annotations, as well, and, more generally, to any type of reference to 
a data object. 

[0024] As used herein, the term change detection value generally refers to any 
numerical, derived, or filtered value, regardless of length, generated by an algorithm 
performed on portions of data to encode, detect changes in, or validate 
completeness of the portions of data. Any suitable algorithm may be used to 
generate the change detection value, including, but not limited to, simple summing 
algorithms and more complex algorithms, such as cyclic redundancy check (CRC) 
algorithms and MD5 filtering functions commonly used to generate checksums. 
Further, a change detection value may be larger than the portions of data or the 
portions of data may be used as a change detection value. While a change 
detection value may be all of these, to facilitate understanding, embodiments of the 
present invention may be described with reference to a checksum as a particular, 
but not limiting, example of a change detection value. Accordingly, the following 
references to checksums may be applied to change detection values in general. 

[0025] Further, as used herein, the term user may generally apply to any entity 
utilizing the annotation system described herein, such as a person (e.g., an 
individual) interacting with an application program or an application program itself, 
for example, performing automated tasks. While the following description may often 
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refer to a graphical user interface (GUI) intended to present information to and 
receive information from a person, it should be understood that in many cases, the 
same functionality may be provided through a non-graphical user interface, such as 
a command line and, further, similar information may be exchanged with a non- 
person user via a programming interface. 

[0026] One embodiment of the invention is implemented as a program product 
for use with a computer system such as, for example, the enterprise system 100 
shown in FIG. 1 and described below. The program(s) of the program product 
defines functions of the embodiments (including the methods described herein) and 
can be contained on a variety of signal-bearing media. Illustrative signal-bearing 
media include, but are not limited to: (i) information permanently stored on non- 
writable storage media (e.g., read-only memory devices within a computer such as 
CD-ROM disks readable by a CD-ROM drive); (ii) alterable information stored on 
writable storage media (e.g., floppy disks within a diskette drive or hard-disk drive); 
or (iii) information conveyed to a computer by a communications medium, such as 
through a computer or telephone network, including wireless communications. The 
latter embodiment specifically includes information downloaded from the Internet 
and other networks. Such signal-bearing media, when carrying computer-readable 
instructions that direct the functions of the present invention, represent embodiments 
of the present invention. 

[0027] In general, the routines executed to implement the embodiments of the 
invention, may be part of an operating system or a specific application, component, 
program, module, object, or sequence of instructions. The software of the present 
invention typically is comprised of a multitude of instructions that will be translated 
by the native computer into a machine-readable format and hence executable 
instructions. Also, programs are comprised of variables and data structures that 
either reside locally to the program or are found in memory or on storage devices. 
In addition, various programs described hereinafter may be identified based upon 
the application for which they are implemented in a specific embodiment of the 
invention. However, it should be appreciated that any particular nomenclature that 
follows is used merely for convenience, and thus the invention should not be limited 
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to use solely in any specific application identified and/or implied by such 
nomenclature. 

AN EXEMPLARY ENVIRONMENT 

[0028] FIG. 1 illustrates a client-server view of an exemplary system 100 in which 
an annotation system in accordance with the present invention may be utilized to 
exchange information, captured in the form of annotations, between users 
collaborating on a project. For some embodiments, the annotation system utilized in 
the system 100 may be similar in operation to an annotation system described in the 
commonly owned, co-pending application 10/600,014, entitled "Universal Annotation 
Management System." 

[0029] As illustrated, the system 100 generally includes one or more client 
computers 102 (e.g., user workstations) and at least one server computer 104. The 
client computers 102 and server computer 104 may be connected via a network 
127. In general, the network 127 may be any combination of a local area network 
(LAN), a wide area network (WAN), wireless network, or any other suitable type 
network, including the Internet. 

[0030] As illustrated, the client computers 102 generally include a Central 
Processing Unit (CPU) 110 connected via a bus 108 to a memory 112, storage 1 14, 
input devices 116, output devices 119, and a network interface device 118. The 
input devices 116 may be any devices to give input to the client computer 102, such 
as a mouse, keyboard, keypad, light-pen, touch-screen, track-ball, or speech 
recognition unit, audio/video player, and the like. The output devices 119 may be 
any suitable devices to give output to the user, including speakers and any of 
various types of display screen. Although shown separately from the input device 
116, the output device 119 and input device 116 could be combined (e.g., a display 
screen with an integrated touch-screen. 

[0031] The network interface device 118 may be any entry/exit device configured 
to allow network communications between the client computer 102 and the server 
computer 104 via the network 127. For example, the network interface device 118 
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may be a network adapter or other network interface card (NIC). Storage 1 14 is 
preferably a Direct Access Storage Device (DASD). Although shown as a single unit, 
storage 1 14 may be any combination of fixed and/or removable storage devices, 
such as fixed disc drives, floppy disc drives, tape drives, removable memory cards, 
or optical storage. The memory 112 and storage 114 could be part of one virtual 
address space spanning multiple primary and secondary storage devices. 

[0032] The memory 1 12 is preferably a random access memory (RAM) 
sufficiently large to hold the necessary programming and data structures of the 
invention. While the memory 1 12 is shown as a single entity, it should be 
understood that the memory 112 may in fact comprise a plurality of modules, and 
that the memory 112 may exist at multiple levels, from high speed registers and 
caches to lower speed but larger DRAM chips. Illustratively, the memory 112 
contains an operating system 124. Examples of suitable operating systems, which 
may be used to advantage, include Linux and Microsoft's Windows®, as well as any 
operating systems designed for handheld devices, such as Palm OS®, Windows® 
CE, and the like. More generally, any operating system supporting the functions 
disclosed herein may be used. 

[0033] The memory 1 12 is also shown containing at least one application 120 
(optionally shown with an associated annotation plug-in 122 and an annotation 
broker 128). The application 120 may be any of a variety of applications used to 
manipulate (e.g., create, view, and/or edit) documents 117. For example, the 
application 120 may be a text editor/word processor used to manipulate annotatable 
documents, a document generator/viewer (such as Adobe's Acrobat ® and Acrobat 
Reader) used to manipulate documents, or data analysis software, such as Decision 
Site available from Spotfire, Inc., imaging software used to manipulate images, and 
any other types of applications used to manipulate various types and forms of data 
that may be embodied in documents 117. The documents 117 may be stored in a 
separate data store than annotations describing them. Different versions of 
documents 117 may exist at any given time. As described herein, a number of 
different "annotation versioning" policies may be available to determine how 
annotations originally made for one version of a document 117 may be applied to 
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subsequent versions of the same document 117. 

[0034] As illustrated, for some embodiments, documents 1 1 7 manipulated by the 
application 120 may be managed by a content management system 170. In such 
embodiments, enforcement of annotation versioning policies may be coordinated 
with the content management system 170 (e.g., via an annotation plug-in 172), for 
example, when modified documents (checked out from the content management 
system 170) are checked back in. By providing a central location from which 
annotated documents 1 17 are accessed, the content management system 170 may 
greatly facilitate annotation versioning policy enforcement. 

[0035] Some application programs 120 may be configured to communicate with 
the annotation server 140 directly, for example, via a set of application programming 
interface (API) functions (not shown) provided for the annotation server 140. As 
used herein, the term API generally refers to any set of interface functions (e.g., 
implementing any suitable inter-process protocol) that may be used to communicate 
between a client computer or process and a server computer or process. Other 
application programs, however, may communicate with the annotation server 140 
via plug-in components 122 and/or the annotation broker 128 (e.g. also via API 
functions). In other words, annotation capability may be added to an existing 
application 120 via the plug-in components 122. The plug-in components 122 may, 
for example, present graphical user interface (GUI) screens to users of applications 
120, thus allowing the creation and retrieval of annotations from within the 
applications used to manipulate the annotated data. 

[0036] The annotation broker 128 is an optional component and may be 
implemented as a software component configured to present a standard interface to 
the annotation server 140 from various applications 120, for example, 
communicating with plug-in components 122 from multiple applications running on 
the same client computer 102. Hence, the annotation broker 128 may provide a 
degree of separation between the applications 120 and the annotation server 140, 
hiding detailed operation of the annotation server 140 and facilitating development of 
plug-in components 122. In other words, new applications 120 may be supported 
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through the development of plug-in components 122 written in accordance with the 
annotation broker interface. 

[0037] Components of the server computer 104 may be physically arranged in a 
manner similar to those of the client computer 102. For example, the server 
computer 104 is shown generally comprising a CPU 135, a memory 132, and some 
type of storage system, such as a database management system (DBMS) 154, 
coupled to one another by a bus 136, which may all functions as similar components 
described with reference to the client computer 102. The server computer 104 is 
generally under the control of an operating system 138 (e.g., IBM OS/400®, UNIX, 
Microsoft Windows®, and the like) shown residing in memory 132. As illustrated, for 
some embodiments, the server computer 104 may be configured with a content 
management system 170, such as IBM's Content Manager, generally configured to 
manage documents 117. 

[0038] As illustrated, the server computer 104 may also be configured with the 
annotation server 140, also shown residing in memory 132. The annotation server 
140 provides annotation clients (e.g., running on one or more client computers 102) 
with access to the annotation store 139, for example, via annotation API functions. 
In other words, the annotation API functions generally define the interface between 
annotation clients and the annotation server 140. As used herein, the term 
annotation client generally refers to any user interface (or other type front-end logic) 
of the annotation system that communicates with the annotation server to 
manipulate (e.g., create, update, read and query) annotation data. Examples of 
annotation clients include applications 120 communicating with the annotation 
server 140 (directly, or via plug-in components 122) or an external application, such 
as an annotation browser (not shown). 

[0039] The annotation server 140 may be configured to perform a variety of 
operations, such as responding to requests to create annotations for specified data 
objects, formulating and issuing queries against the annotation store 139 to search 
for annotations for a specified data object, and formulating and issuing queries 
against the annotation store 139 to search for annotations satisfying one or more 
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specified conditions (e.g., having a specified author, creation date, content, and the 
like). The annotations may be contained in annotation records 150, for example, 
stored in an annotation database 139. 

[0040] The annotation records 150 may also contain various information about 
the annotation, such as the author and creation date of the annotation, as well as an 
index identifying annotated portions of documents 117. For some embodiments, the 
annotation server 140 may include an indexing component (shown in FIG. 2) 
configured to generate an index for an annotated data object, for example, based on 
one or more parameters identifying the annotated data object (e.g., a database 
table, row, and/or column). Indexes created for annotated portions of documents 
117 may be stored in an index table 152 in the annotation data store 139. For some 
embodiments, the index table 152 may be queried to identify annotations for 
documents 117. 

[0041] As illustrated, the annotation server 140 may have an associated 
maintenance component 144 generally configured to maintain annotations for 
documents 117. As will be described in greater detail below, the annotation 
maintenance component 144 may play a role in implementing and enforcing policies 
for maintaining annotations when changes are made to annotated documents 117. 

[0042] FIGs. 2 is a relational view of software components, including the 
annotation maintenance component 144, according to one embodiment of the 
present invention, shown during annotation generation. As illustrated, an annotation 
153 for a portion of a document 117 (identified by a set of ID parameters) may be 
created via an annotation generation component 143. An indexing component 142 
may create an index 151 based on the set of ID parameters, for use in indexing the 
annotation 153 created for the identified data object. In general, an index 151 points 
to the annotated portion or position within a corresponding annotated document 117. 

[0043] A corresponding entry for the index 151 may be placed in the index table 
152, allowing the index table 152 to be queried to identify documents and portions of 
documents having corresponding annotations. For some embodiments, entries in 

the index table 152 may simply contain ID parameters indicating an annotated data 
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object (e.g., identification of a data source/table, a row, and column). For other 
embodiments, however, table entries may include index parameters generated 
based on the ID parameters. Such indexing techniques are described in the 
commonly assigned, co-pending application 10/600,382, entitled "Heterogeneous 
Multi-Level Extendable Indexing For General Purpose Annotation Systems," filed 
June 20, 2003. 

[0044] The annotation maintenance component 144 may also generate a change 
detection value 155 (e.g., a checksum or hash value, such as MD5) for the 
annotated document or annotated portion of the document, to be stored in the 
annotation record 150. The annotation maintenance component 144 may detect 
changes in a document 1 17 or portion of a document 117 associated with an 
annotation by generating a new change detection value and comparing it to the 
change detection value stored in the corresponding annotation record 150. The 
generation and use of change detection values is described in detail in the 
commonly assigned, co-pending Application 10/310,192, entitled "Annotation 
Validation Using Partial Checksums." In response to detecting a change to an 
annotated document 117, the annotation maintenance component 144 may delete, 
modify (e.g. update an index 151), or maintain the annotation record 150 as-is, 
depending on a type of policy selected for maintaining annotations for different 
versions of documents 117. 

EXEMPLARY ANNOTATION VERSIONING POLICIES 

[0045] FIG. 3 shows a table 300 illustrating exemplary policies for maintaining 
annotations for multiple versions of a document (hereinafter "annotation versioning 
policies" or simply "policies"). The policies may be applied at the annotation level, 
the document level, or some combination of both. In other words, in some cases, 
annotation policies may be set from within a document (or application used to 
manipulate that document), and applied to all annotations created for that document. 
In some cases, annotation policies may be set when an annotation is created. In 
some cases, a default policy may be set at the document level, which may be 
changed when creating an annotation. 
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[0046] In any case, the first policy shown in table 300 is a "no-change" policy 
dictating that annotations stay with the original version of a document for which they 
were created and do not apply to subsequent versions. In other words, if a 
document is changed, the old annotation does not apply to the new document. 
Indexes to the original document apply to the annotation, while indexes in the new 
document do not. Content of the annotation may be changed, with the changed 
content applying to the same version document as the original content. 

[0047] The second policy shown in table 300 dictates that annotations are 
applied to multiple versions of a document. An annotation on an original document 
also points to subsequent (changed) versions of the document. In other words, a 
user would see the same annotation whether viewing the old or a newer version of 
the document. If a change is made to the annotation content, the change in 
annotation content will appear whether the old or newer version of the document is 
displayed. While this may cause confusion if the changes to the source document 
are large and varied, it may be a preferred policy in some cases. For example, there 
may be cases where an annotation captures a general insight regarding a topic 
addressed in an annotated portion. As such, the annotation is likely to remain valid 
regardless of changes to the annotated document or the annotated portion of the 
document. 

[0048] A third policy shown dictates that for each changed version of a 
document, each annotation is carried forward to the most accurate position in the 
document. In other words, multiple copies of annotation records 150 may exist for 
each annotation, with the index of each subsequent copy updated to reflect the 
changes between versions. For some embodiments, indexes may be updated 
automatically (e.g., an application may scan a document to identify an originally 
annotated portion in a subsequent version). In some cases, however, an originally 
annotated portion may not be readily identified in a subsequent version, due to the 
magnitude of changes or deletion. As a result, in some cases, this approach may 
lead to annotations being carried forward regardless of semantic and contextual 
correctness. 
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[0049] Therefore, for some embodiments, a user may specify that annotations 
must be validated (by someone with authority) before they are carried forward to 
new versions of a document. This policy (last in table 300) may be applicable in an 
environment where the person who makes the changes to the document also has 
authority to see and determine the applicability of a previous annotation to the 
changed document. This policy may be used in conjunction with the second or third 
policies. A related policy may dictate that annotations viewed from a different 
version of a document (e.g., other than the version for which it was created) should 
be marked as unverified indicating it was carried forward but has not been formally 
approved, effectively putting a viewer on notice they must determine whether the 
annotation still applies. An authorized user, such as the original author of the 
annotation, may re-verify the annotation, resulting in removal of the unverified 
marking. 

[0050] It should be understood that the listed policies are exemplary only and that 
different embodiments of annotation systems may support more, less, or different 
policies than those listed. Further, as will be described below, some of the listed 
policies may be combined. Policies may also be added and removed from the 
system, for example, according to customer or administrative needs. An 
administrator of the system may select one or more policies to make available to 
users during system configuration and may also indicate a default policy. Users 
may also be allowed to set their own policy from the set of available policies. In 
other words, a user creating an annotation may select from whatever policies are 
allowed to apply to their annotation. 

[0051] For example, FIG. 4 illustrates an exemplary graphical user interface GUI 
screen 440 that may be used to select a policy for applying an annotation to 
subsequent versions of the document. The GUI screen 440 may be accessed, for 
example, from an application 120 to set an annotation policy at the document level 
(e.g., to be applied to all annotations created for that document) or at an annotation 
level (e.g., to be applied only to an annotation), for example, when creating an 
annotation. 
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[0052] As illustrated, the GUI screen 440 may allow, the user (via radio buttons 
442, 444, and 445) to specify whether the annotation should be applied to all 
versions of the document (i.e., the second policy of table 300), carried forward to 
subsequent versions of the document (i.e., the third policy of table 300), or applied 
only to the original annotated version of the document (i.e., the first policy of table 
300), respectively. Further, a number of validation option may be specified if the 
annotation is applied or carried forward to subsequent versions. For example, a 
user may specify (via radio buttons 446, 447, and 448, respectively) that validation 
may be required prior to carrying the annotation forward to subsequent versions, 
that the annotation may be carried forward automatically and marked as unverified, 
or neither. As will be described in greater detail below, if user validation is required, 
the annotation maintenance component may be configured to notify the user (e.g., 
via e-mail, instant message, or other type of notification technique) in the event a 
change to the annotated document is detected. 

ANNOTATION VERSIONING POLICY ENFORCEMENT 

[0053] In some business enterprises, a content management system is used to 
manage a wide variety of content (generically referred to herein as documents) 
accessible to and accessed by a number of users. For example, referring back to 
FIG. 1, a content management system 170 may manage documents 117 accessed 
by a number of users of the system 100. In systems utilizing such content 
management systems, documents are checked out by users to be viewed or 
modified and checked back in when the user is through viewing or modifying the 
document. The content management system typically tracks changes and maintains 
multiple versions of the documents. 

[0054] For some embodiments, annotation versioning policies may be enforced 
in conjunction with a content management system (e.g., when an annotated 
document is checked back into the content management system). For example, 
FIG. 5 illustrates a relational view of software components of a system 500 in which 
an annotation maintenance component 144 enforces annotation versioning policies 
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as annotated documents 1 17 are checked back into a content management system 
170. 

[0055] Operation of the system 500 may be described with reference to FIG. 6, 
which illustrates a flow chart of exemplary operations 600 that may be performed, for 
example by the annotation maintenance component 144 and/or content 
management system 170, to maintain annotations for multiple versions of 
documents according to one embodiment of the present invention. For example, in 
some cases, the annotation maintenance component 144 may be integrated with the 
content management system 170, for example, as a plug-in component. 

[0056] The operations 600 begin, at step 602, by receiving a document 1 1 7, for 
example, by the annotation maintenance component 144, to be checked into the 
content management system 170. At step 604, a determination is made as to 
whether the document has changed, for example, by comparing a change detection 
value 155 stored in a corresponding annotation record 150 with a changed detection 
value calculated for the document 117. If the document 1 1 7 has not changed, it is 
checked into the content management system 170, at step 605. 

[0057] If the document 1 1 7 has changed, the document 1 1 7 is saved as a new 
version (or at least the revised document 1 17 is saved), at step 606. At step 608, a 
determination is made as to whether annotations are to be carried forward to the 
new version of the document 117. If not, the operations 600 are exited, at step 610. 
If annotations are to be carried forward to the new version of the document 117, new 
annotation records 150 for the new version of the document are created (or existing 
annotation records 150 are updated) at step 612. At step 614, the annotation 
indexes 151 are updated to refer positions in the new version of the document 117 
and a new change detection value 155 is generated for the new version of the 
document, at step 616. 

[0058] Depending on the annotation versioning policy selected (e.g., by the 

annotation author), the annotation maintenance component 144 may carry 

annotations forward to refer to their respective positions within a document 117 

when that document 117 changes. For example, after the revised document 117 
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has been saved in the content management system 170, the annotation 
maintenance component 144 may then create and store new annotation records 
(with indexes referring to the revised document 1 17) in the annotation store 139. As 
previously described, the annotation maintenance component may update the 
corresponding index 151 to reflect the revised document and update the change 
detection value 155 (illustratively shown as a checksum) to reflect revisions to the 
annotated document 1 17 or the corresponding annotated portion thereof. 

[0059] As previously described with reference to FIGs. 3 and 4A-4D, exactly 
when and how annotations are carried forward (e.g., steps 612-616 of FIG. 6) may 
depend on options a user selected when creating the annotation and whether a 
current user (is authorized . In some cases, annotation records may be 
automatically created/updated or user interaction may be required, in some 
instances. FIG. 7 illustrates a flow diagram of exemplary operations 700 that 
illustrate how and when an annotation may be carried forward and when user 
interaction may be sought, for example, based on options selected creator or author 
of the original annotation. 

[0060] The operations 700 begin, at step 702, by receiving a document 117. At 
step 704, a determination is made as to whether the annotation creator enabled 
annotation carry forward, for example, via check box 424 of FIG. 4B. If so, a 
determination is then made, at step 706, as to whether a current user (e.g., a user 
checking in the document 1 17) is authorized to carry forward annotations. For 
example, depending on the system settings, only users operating in certain roles 
may be authorized to carry annotations forward. If the current user is not authorized 
to carry annotation forward, the operations are exited, at step 708, for example, by 
checking in the document 1 17 without carrying annotations forward. 

[0061] If the user is authorized to carry annotations forward, annotations from 
previous versions of the document are retrieved, at step 710. As previously 
described, for some embodiments, the annotation maintenance component 144 may 
be configured to automatically update indexes to refer to annotated portions of a 
revised document. Therefore, at step 712, an algorithm is run to identify (previously) 
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annotated portions in the new version of the document 117. For example, the 
algorithm may employ pattern matching, searching the new version of the document 
1 17 for annotated portions from the previously annotated version of the document 
117. If the annotated portions are identified by the algorithm (step 714), the indexes 
are automatically updated, at step 720 and new change detection values are 
generated at step 722, prior to exiting the operations 700, at step 724. 

[0062] If the algorithm did not identify the annotated portions in the new version 
of the document 1 17, a user (e.g., the original annotation author or last user to 
modify the annotation) may be notified, at step 716. For example, the user may be 
given the opportunity to utilize a tool to manually identify the portions of the new 
version of the document 1 1 7 to which the annotations should be carried forward. 
For some embodiments, the tool may indicate to the user where the annotation may 
apply based on results of the algorithm (e.g., by highlighting text in the document 
117, such highlighted text 405 in FIG. 4A) and lets the user carry the annotations 
forward to the new version of the document 117. Input from the user (e.g., a manual 
selection indicating annotated portions in the new version of the document 117) may 
be received, at step 718, and used to update the indexes at step 720. 

[0063] As previously described, user input may also be required if the creator of 
the annotation specified that validation is required for carrying an annotation forward 
(e.g., by the annotation author or other authorized user). In any case, a user may be 
able to check out a document from the content management system 170 while the 
carrying forward of annotations related to the document is pending another user's 
actions to carry it forward (e.g., validation or identification of the annotated portions 
in a new version). In such cases, the user checking out the document may be 
notified that there are pending changes to be made for annotations related to the 
document. Further, as previously described, annotations for such a document may 
be marked as unverified while the changes are pending. 

CONCLUSION 

[0064] By supporting multiple policies for applying annotations to different 
versions of documents, users may be able to select a policy that best fits their needs 
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and gain some level of assurance the annotations will be properly applied as the 
document changes. For systems utilizing a content management system, enforcing 
such policies when a document is checked in may facilitate detecting changes to 
documents, or annotated portions thereof, and allow corresponding annotation 
indexes to be updated to reflect changes to the document. As a result, multiple 
users accessing a shared set of documents may be assured that annotations for the 
documents are valid and apply to the current versions of the documents. 

[0065] While the foregoing is directed to embodiments of the present invention, 
other and further embodiments of the invention may be devised without departing 
from the basic scope thereof, and the scope thereof is determined by the claims that 
follow. 
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